Towards easy prototyping of pattern mining problems

نویسندگان

  • Frédéric Flouvat
  • Fabien De Marchi
  • Jean-Marc Petit
چکیده

In the last decade, plenty of algorithms, benchmarks, and experimental studies have been carried out for pattern mining problems. In this paper, we focus on the special class of pattern mining problems known to be ”representable as sets”. In this setting, the main contribution of this paper is to take advantage of the common theoretical background of these problems from an implementation point of view by proposing a library of efficient data structures and algorithms for pattern mining. Thus, every problem fulfilling the theoritical requirements could be implemented with only minimal effort. According to our first results, the programs obtained using our library offer a very good tradeoff between performances and simplicity of their development.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The iZi Project: Easy Prototyping of Interesting Pattern Mining Algorithms

In the last decade, many data mining tools have been developed. They address most of the classical data mining problems such as classification, clustering or pattern mining. However, providing classical solutions for classical problems is not always sufficient. This is especially true for pattern mining problems known to be “representable as set”, an important class of problems which have many ...

متن کامل

An Embedded Domain Speci c Language for Pattern Mining: a First Attempt

Logical query languages for pattern mining and their denotational semantics formally de ne what are interesting patterns in relational databases. The functional programming language Haskell provides an elegant framework to write compilers and interpreters for recursivelyde ned languages with denotational semantics. In particular, it is especially good at embedding domain speci c languages. This...

متن کامل

Large Scienti c Data Sets 1 Intelligent Simulation Tools for Mining Large Scienti c Data Sets

This paper describes problems, challenges, and opportunities for intelligent simulation of physical systems. Prototype intelligent simulation tools have been constructed for interpreting massive data sets from physical elds and for designing engineering systems. We identify the characteristics of intelligent simulation and describe several concrete application examples. These applications, whic...

متن کامل

Towards Simple, Easy to Understand, an Interactive Decision Tree Algorithm

Data mining is intended to extract hidden useful knowledge from large datasets in a given application. This usefulness relates to the user goal, in other words only the user can determine whether the resulting knowledge answers his goal. Therefore, data mining tool should be highly interactive and participatory. This paper presents an interactive decision tree algorithm using visualization meth...

متن کامل

Soft constraint based pattern mining

The paradigm of pattern discovery based on constraints was introduced with the aim of providing to the user a tool to drive the discovery process towards potentially interesting patterns, with the positive side effect of achieving a more efficient computation. So far the research on this paradigm has mainly focused on the latter aspect: the development of efficient algorithms for the evaluation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007